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Abstract 

It is widely assumed that 0{m + Iga) is the best one can do for finding a pattern of length 
m in a compacted trie storing strings over an alphabet of size cr, if one insists on linear-size data 
structures and deterministic worst-case running times [Cole et al., ICALP'06]. In this article, 
we first show that a rather straightforward combination of well-known ideas yields O(m-l-lglgtT) 
deterministic worst-case searching time for static tries. 

Then we move on to dynamic tries, where we achieve a worst-case bound of 0{m + i^ig f^^ ) 
per query or update, which should again be compared to the previously known 0(to -|- Igcr) 
deterministic worst-case bounds [Cole et al., ICALP'06], and to the alphabet independent 0{m+ 
y'lg n/ Iglg n) deterministic worst-case bounds [Andersson and Thorup, SODA'Ol], where n is 
the number of nodes in the trie. The basis of our update procedure is a weighted variant of 
exponential search trees which, while simple, might be of independent interest. 

As one particular application, the above bounds (static and dynamic) apply to suffix trees. 
There, an update corresponds to pre- or appending a letter to the text, and an additional 
goal is to do the updates quicker than rematching entire suffixes. We show how to do this 
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in 0(lglgn -|- ig^ig ) time, which improves the previously known O(lgn) bound [Amir et al., 
SFIRE'OSl. 



1 Introduction 

Text indexing is a fundamental problem in computer science. It requires storing a text of length n, 
composed of letters from an alphabet of size a, such that subsequent pattern matching queries can 
be answered quickly. Typical such pattern matching queries are (1) existential queries (deciding 
whether or not the pattern occurs in the text), (2) counting queries (determining the number of 
occurrences), and (3) enumeration queries (listing aU positions where the pattern occurs). The text 
is either static, or can be modified by appending new letters. 

Weh-known static text indexes are suffix trees [l6] and suffix arrays [15] . The former admit, in 
their plain form, 0(m Igo") existential and counting queries for a pattern of length m, while the 
latter achieve 0{m + Ign) time with the help of two additional arrays of size n storing the lengths 
of longest common prefixes needed during the binary search. With perfect hashing [10|, suffix trees 
achieve 0{m) time, but then the 0{n) preprocessing time is only in expectation. Enjoying the 
best of both worlds, the suffix tray of Cole et al. [o] achieves 0{m + Igo") searching time, with a 
linear space data structure that can be constructed in 0{n) deterministic time for a static text. In 
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the dynamic setting, suffix trees can be updated in amortized expected constant time, where the 
amortization comes from the need to locate the node which should be updated, and expectation 
from the hashing used to store outgoing edges. If we insist on getting worst-case time bounds, a 
recent result of Kopelowitz 14 allows updates in ©(Iglgn + lglga) worst-case (but still expected) 
time. The suffix trists of Cole et al. [o] achieve a deterministic bound of 0{m + Igo") for searching, 
with a linear space data structure that can be updated in 0{f{n, a) + lg a) deterministic and worst- 
case time, where f{n,a) is the time required to locate the edge of the suffix tree which should be 
split, and the best bound on f{n,a) known so far for the general case is O(lgn) [2]. In all cases, 
0{occ) additional time is needed for enumerating the occ occurrences. 

In all of the tree based structures mentioned in the previous paragraph, the crucial point is how 
to implement the outgoing edges of the tree such that they can be searched efficiently for a given 
query character. This is the general setting of trie search, and in fact, we can and do formulate 
our results in terms of tries, and view suffix trees as one particular application. In this setting, it 
is worth mentioning the result of Andersson and Thorup |4|, who show how to update or search 
the trie in 0{m + y^lgn/lg Ign) deterministic worst-case time, for n being the number of stored 
strings. In this article, however, we focus on alphabet- dependent running times, as outlined next. 



1.1 Our Result and Outline 

Our main results are summarized in the following theorems. We work in the standard word RAM 
model, where it is assumed that the memory consists of r2(lgn)-bit cells that can be manipulated 
using standard C-like operations in 0(1) time. 

In tries, we support stronger forms of existential queries, namely prefix queries (deciding whether 
or not the search pattern is a prefix of one of the stored strings), and predecessor queries (returning 
the largest string stored that is no less than the query pattern): 

Theorem 1. A compacted trie storing n static strings over an alphabet of size cr can be stored in 
0{n) space (in addition to the stored strings themselves) such that subsequent prefix- or predecessor 
queries can be answered in 0{m + Iglga) deterministic worst-case time, for patterns of length m. 
This data structure can be constructed in deterministic 0{n) time, for a = n'^^^\ 

We note that while for prefix queries the "+lglgcr"-term would in principle not be necessary, for 
the supported predecessor queries it certainly is. As one application, we mention suffix trees [18]: 

Corollary 2. Given a static text of length n over an alphabet of size a = n'^^^\ we can build in 
0{n) time a linear-space data structure such that subsequent on-line pattern matching queries can 
be answered in 0{m + Iglg cr) deterministic time, for patterns of length m. 

In the dynamic setting we get the following result: 

Theorem 3. We can maintain a linear-size structure for a trie storing strings over an alphabet of 
size a under adding a new string of length I in deterministic worst-case time 0(£+ j^j^^^) so that 

subsequent on-line prefix- or predecessor queries can be answered in 0{m + i^i^f^^ ) deterministic 
time, for patterns of length m. 

This result can be formulated in terms of suffix trees, too: 
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Corollary 4. We can maintain a linear-size structure for a text over an alphabet of size a = n'^(^) 
under prepending a new letter in deterministic worst-case time 0{f{n,a) + jjf |g\g^ ) so that on- 
line pattern matching concerning the current text can be answered in 0{m + i^i^f^^ ) deterministic 
worst-case time, for patterns of length m. f{n, a) is the cost of updating the suffix tree for a text n 
over an alphabet of size a after prepending a letter. 

It was already noted that the currently best bound [2] for deterministic worst-case suffix tree 
oracles is /(n, a) = Ign. Though not being the main goal of this article, in the appendix we show 
a better suffix tree oracle, giving us truly better total running times: 

Theorem 5. There is a deterministic worst-case suffix tree oracle with f(n,a) = 0(lglgn + 
ig^ ig o- \ 

Ig Ig Ig cr / ■ 

1.2 Technical Contributions 

Our main technical novelty is a weighted variant of exponential search trees p], which we term 
wexponential search trees. The original exponential search tree achieves 0(lglgn • j^f^f^) search 
and update times for n elements over a universe of size u. Our weighted variant generalizes this 
to 0(lg^^^ • |g^g^^„ ) ; where w is the weight of the searched element, and W is the sum of all 
weights. The advantage of this is that in a sequence of t hierachical accesses to such data structures, 
where the old "w" is always the new 'W, the sum telescopes to 0(lglgn • j^^^f^) instead of 

0(t-lglg n • igf gf g ) • W^hile this general idea is pervasive in data structure, we are not aware of any 
previous application in the doubly-logarithmic setting. 

2 Preliminaries 

This section introduces definitions and known results that form the base of new data structure. 

2.1 SufRx Arrays 

Let T = ti . . .tn he a text consisting of n characters drawn from an ordered alphabet S of size 
cr = The substring of T ranging from i to j is denoted by Tj .j, for 1 < i < j < n. The substring 
Tj..„ is called the i'th suffix of T and is denoted by T*. 

The suffix array SA[1, n] of T is a permutation of the integers in [1, n] such that T^^i^~^'\ <[g^ 2^SA[«] 
for all 1 < i < n. In other words, SA describes the lexicographic order of the suffixes. The suffix 
array can be built in linear time for a = n^^^^ |13j . 

2.2 Suffix Trees 

The suffix tree ST is a compacted trie over all the suffixes of T. By concatenating all edge labels 
on a root-to-node path, its nodes spell out substrings of T. If we append a unique character $ E 
to T, then every leaf spells out exactly one suffix of T$. Indeed, if the leaves are visited in a 
lexicographically driven depth first traversal of ST, then the suffixes are visited in the order of the 



3 




Figure 1: Schematic view of a suffix tray. 



suffix array SA. This imphes that every node v of ST represents an interval [iv^fv] in SA, whose 
endpoints and r„ can be stored with v: if the root-to-f path spehs out a E S*, then v stores two 
integers iv and r^, such that T^^i^-"] is the lexicographically first suffix starting with a, and T^^l"^^] 
is lexicographically last. We call these the suffix array intervals. 

Given the suffix array, the suffix tree can be constructed in 0(n) time by repeatedly inserting 
the suffixes in lexicographic order, with the help of the so-called LCP-array [II] . 

Given the suffix tree for a text T, constructing the suffix tree for a text aT, for any letter a, 
requires adding just one additional suffix, which in a compacted trie means splitting at most one 
edge into two parts and adding a single edge outgoing from the middle node (or one of the already 
existing nodes, if no new middle node was created). We call this splitting an edge. In the dynamic 
setting we will assume the existence of a suffix tree oracle telling us which edge should be split after 
prepending a, and denote the time taken by a single such call by /(n, a). The oracle is assumed to 
be deterministic, but the bound on f{n,a) can be amortized if we aim at getting amortized time 
bounds. 

2.3 Suffix Trays 

The suffix tray |9| is a blend of the suffix tree and the suffix array. The idea is to discard small 
subtrees of the suffix tree and only keep the upper part where nodes have sufficiently many leaves 
below them. Then this upper part can be augmented with "expensive" information to enable 0{m) 
pattern searches if the pattern ends in the upper part. For patterns not ending in the upper part, 
one switches to binary suffix array search as outlined in the introduction. If the binary searched 
intervals are of size 0{a^^^^), this latter step costs only 0{m + Igcr) time. 

More precisely, we classify the suffix tree nodes into heavy and light as follows: a node is called 
heavy if it has at least a leaves in its subtree; otherwise it is called light. The heavy nodes form 
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a connected subtree (this is the upper part aluded to above) . Heavy nodes are further subdivided 
into branching and nonbranching, depending on whether they have at least two heavy children or 
just one. See also Fig. [l] 

Light children of heavy nodes store their corresponding suffix array interval. The heavy nodes 
store additional information as described next. Nonbranching heavy nodes v store a special pointer 
Pv to their only heavy child. Branching heavy nodes v store an array of size (T, where 
stores a pointer to the node w such that the first character on the edge {v, w) is a £ S. Since there 
are only O(^) branching heavy nodes, these arrays take only 0{n) space in total. 

Now a search for a pattern P[l,m] proceeds as follows. Assume inductively that we have already 
matched -P[l,«] and are currently at node v (we start at the root with i = 0). At branching heavy 
nodes v, we can move in 0(1) time to the correct child using the array A^. If the current node v is 
nonbranching internal, we first check if the first character on the edge {v,py) equals P[i + 1]. If so, 
we proceed with p^. Otherwise, we binary search among the children of v for the character + 1]. 
This will bring us to a light child w oi v, where we binary search SA within the suffix array interval 
[^«), ''^u)]- A similar binary search is performed if the array A^ directs us to a light node. Since there 
are at most two binary searches on arrays of size 0{a), the claimed running time of 0{m + Igo") 
follows. 

2.4 Suffix Trists 

The suffix trist [9] is a dynamic version of the suffix tray. The idea is, as in the suffix tray, to 
maintain the suffix tree, and store the outgoing edges in different ways depending on the size of 
the subtree. We assume that a suffix tree oracle is available, and hence on updating the sets on 
those edges. For nodes with a large number of leaves in their subtrees, we can afford to store the 
outgoing edges in a large but quick to both access and update data structure. For nodes with 
just a few leaves, we switch to a different representation, called Balanced-Indexing-Structure. The 
main difficulty is that the size of a subtree increases in time, and hence one needs to cleverly move 
from one representation to another. Even more challenging part of the problem is that we want 
the bounds to be worst-case, hence the transition between different representations must be done 
in an incremental fashion. 

We do not describe the details of the original solution, as our idea is slightly different. We just 
state that its time bounds are 0{f{n,a) + Igci) for updates and 0{m + Igu) for queries, both 
deterministic worst-case. 

2.5 Deterministic Hashing and Predecessor Data Structures 

We make use of the following result by Ruzic [17^ Theorem 3]. Note that even the 0{k\gk) 
construction time of Hagerup, Miltersen, and Pagh [12] would be enough for our purposes, but their 
implementation requires some compile-time constants not known to be efficiently computable, and 
does not seem to be significantly simpler. 

Proposition 6. In the word RAM model, a static linear-size dictionary on a set of k keys can be 
deterministically constructed in time 0{klg^lgk), so that lookups to the dictionary take time 0(1). 

Combining Prop, [g] with the classic j;-fast tries by Willard [19], we get the following: 
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Proposition 7. In the word RAM model, a static linear-size data structure on a set S of k sorted 
keys from a universe of size u can he deterministically constructed in time 0{k), so that subsequent 
predecessor queries Pred(x) := max{y < x \ y G S} can be answered in O(lglgn) time in the 
worst-case. 

Proof. Achieving 0(A;lgnlg^ lg(A;lgu)) construction time and 0{klgu) space is trivial, as con- 
structing the x-fast tree reduces to storing just klgu elements in a static dictionary. To speed up 
the precomputation and reduce the space to linear, we choose „ ^ evenly spaced keys, store 

them in an x-fast tree constructed in time 0{y^—^-^ Ignlg^lg/c) = 0{k), and use binary search 
to locate the predecessor between two adjacent chosen keys to answer each query. ■ 

We also make use of the result of Beame and Fich ^ Theorem 3] , who achieved optimal query 
time possible in the static setting. For the sake of concreteness, we will assume e = 1. 

Proposition 8. In the word RAM model, a static data structure on a set S of k keys from a 
universe of size u can be deterministically constructed in 0{k^~^'^) time and space, so that subsequent 
predecessor queries Pred(x) := max{y < x | y G S"} can be answered in 0( igfg^^„ ) time in the 
worst-case. 

Proof. In the original formulation the construction time and space were 0(/c^). We can improve 
on this with a constant number of indirection steps. More precisely, given a structure with 0{k^~^'^) 
construction time and space, we can implement a 0{k^~^2'j time and space structure by choosing 
y/k evenly spaced keys, storing them in the more expensive structure, and building a separate more 
expensive structure for each group of keys between two chosen elements. ■ 

The above structure can be dynamized using the method of Andersson and Thorup [4]. 

Proposition 9. In the word RAM model, a dynamic linear-size data structure on a set Sofk keys 
from a universe of size u can he maintained, so that subsequent predecessor queries Pred(x) := 
max{y < x | y G 5} can be answered in 0( /gig\^g" ) time, and new keys can he inserted in 0( /^ig\^g" ) 
time, where both bounds are deterministic worst-case. 



3 New Static Data Structure 

In this section we prove Theorem [TJ We store edges of the trie as pairs of the form {v,a), where 
w is a pointer to the source of the edge, and a G S is the first character on the edge from v to 
its corresponding child w. As secondary information we also attach a pointer to w with the pair 
(f,a). This way, matching a pattern P[l,m] reduces to repeatedly finding correct edges: assuming 
inductively that we have already matched -P[l,i] and are currently at node we check if the edge 
{v,P[i + 1]) exists, and move to that child if this is the case. 

We now describe how the edges are stored. A naive storage with the data structures from 



Sect. 2.5 would result in superlinear construction time. Hence, we need to introduce several levels 



of indirection. Like in the suffix tray (Sect. 2.3 ), we divide nodes into heavy and light, but this time 
with parameter s := 0(lg^ Igf'"): a node with at least s leaves below it is called heavy, otherwise it 
is called light. 
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To continue, we classify the edges of heavy nodes into two different types: (heavy,heavy) and 
(heavy,light). Here, the first component of each tuple refers to the type of the parent node, and 
the second one to that of the child. For each branching heavy node we store the outgoing edges 
of type (heavy,heavy) in a dictionary using Prop. [6| with the key being the first character on 
the edge. Since there are at most ^ heavy nodes with no heavy children, the total size of all 
dictionaries cannot exceed the sum of degrees in a tree on 2^ nodes, which is 0{^). Furthermore, 
the number of elements in each dictionary is at most a, so constructing all those data structures 
takes 0(" x Ig^ Igo") = 0{n) time. For nonbranching heavy nodes, we only have at most one edge 
of type (heavy,heavy), so each such node simply stores a special pointer to the only heavy child, 
which enables us to decide in 0(1) time if we need to continue matching there. 

Finally, we show how to handle the edges of type (heavy,light). For each (branching or not) heavy 
node we store all such outgoing edges using the data structure from Prop. [T] Using the structure, 
we can locate the light child we should descend to in 0(lglg cr) time. Then we binary search the 
suffix array using the suffix array interval stored at the child, taking additional 0(m + Igs) time. 
Summing up, the whole search process takes 0{m + Iglgo") time. 

3.1 Predecessor Queries 

As an additional bonus, the above data structure can be easily augmented to identify the lexi- 
cographic predecessor of P if it does not occur in the underlying text T. For this we need to 
construct additional predecessor data structures (again using Prop. [7]) storing all edges outgoing 
from a node. We also augment each internal node with a pointer to the lexicographically largest 
leaf in its subtree. 

For answering queries, we first try matching P as described above. Now imagine a search for 
P[i + 1] fails at node v. Then we query the new predecessor data structure stored at v. This 
identifies a child w v that is a prefix of the predecessor of P. From w we follow the pointer to 
the rightmost leaf in w's subtree; this is the predecessor of P. The time is 0{m + Iglgcr). 

4 New Dynamic Data Structure 

In this section we prove Theorem [3} we call the resulting data structure the dynamic compacted 
trie search structure. 

Although we want all bounds to be worst-case, we first start with an amortized version, and 
then show how to make it worst-case. In both cases, we apply the powerful exponential search tree 
paradigm of Andersson and Thorup [4]. In the amortized setting, this results in a fairly simple 
method. Making it worst-case is a nontrivial challenge, though. Fortunately, instead of using their 
original deamortization approach, we can follow a significantly simpler method of Bender, Cole, 
and Raman [6] . First we prove that without loss of generality it is enough to show how to maintain 
a tree of size 0{a). Then we develop a new variant of weighted multiway search trees, which we 
call wexponential search trees. Then we show how to use it to build the new data structure, first in 
the amortized setting, and then making the bounds worst-case efficient. 

The worst-case version of the structure gives us Theorem [3j while the simpler amortized variant 
achieves the following bounds. 
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Theorem 10. We can maintain a linear-size structure for a text over an alphabet of size a = 
j^O(i) yj^^gj, prepending a new letter in amortized time so that on-line pattern matching 

concerning the current text can be answered in worst-case time 0{m + i^i^^), for patterns of 
length m, where all complexities are deterministic. 



Proof. We maintain the suffix tree. As we aim for amortized time bounds, we can implement 
the suffix tree oracle using Weiner's algorithm. This requires storing for each node the reverse 
suffix links in a dynamic predecessor structure implemented using Prop. [9] and gives (amortized) 
/(n, o") = 0( /gig\^gCT )- Using such oracle we locate the edge and update it using the amortized 
version of our structure (Sect. 4.3) in 0{ l^^^f^^ ) time. ■ 



4.1 Reducing the Tree 

In this subsection we show how to reduce the general case of maintaining a tree of size n to 
maintaining a collection of smaller trees of size 0{a). More precisely, we prove the following 
lemma. 

Lemma 11. Maintaining a tree of size n can be reduced to maintaining a collection of smaller 
trees with linear total size and each tree being of size 0{a) such that each query concerning the 
original tree can be either answered immediately or translated into at most one query about one 
of the smaller trees, and each update in the original tree can be translated into a constant number 
of updates in some of the smaller trees. The former translation will be performed in deterministic 
worst-case 0{m + /g ig\g^ ) time, and the latter in 0( i|fig\g'^ )- 

While we aim for worst-case bounds, we start with an amortized version of the above formulation, 
which is easier to describe, and then explain how to deamortize it. In both cases the idea will be, 
again, to divide the nodes into heavy and light with parameter s := a, but now we slightly modify 
the definition so that all nodes with at least s leaves are heavy, all nodes with at most | leaves are 
light, and the remaining nodes can be either light or heavy. 

For all light nodes such that the parent is heavy, we maintain a separate smaller tree of size 
0(cj), and explicitly store its number of leaves. For each node v we store a dynamic predecessor 
structure implemented using Prop. [9] storing all of its children. If the node is branching, we keep 
all its heavy children in an array of size cr, and otherwise we simply keep a pointer to the unique 
heavy child of if any. Given a query, we simply use the arrays or pointers to descend as far as 
it is possible, and then use the dynamic predecessor structure to either locate the small tree where 
we should continue, or terminate. (So far, the ideas are similar to the static case.) For updates, 
each light node stores a pointer to the root of the small tree it belongs too. Then after adding a 
new leaf we follow the pointer to r and increase the counter there. As soon as it becomes equal 
to s, we traverse the whole subtree rooted there and make each node with at least | leaves heavy 
and updating pointers to the root for all nodes that have less of them. Observe that all new heavy 
nodes are nonbranching, but it might happen that r's parent becomes branching and we need to 
construct its array. The traversal takes 0{s) time, building the array can be done in 0{(t) time, 
hence the whole complexity is 0(s). We can amortize it by maintaining an invariant that each 
light node with a heavy parent has max(0, ^ — §) credits available, where i is the number of leaves 
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in the corresponding subtree. Whenever we insert a new leaf, we need to allocate one new credit. 
After each traversal we get rid of all light nodes with at least | leaves, hence we are free to use all 
s credits to amortize the expensive operation. 

Deamortizing the above method can be done by performing the traversal (and constructing the 
array, if necessary) incrementally over | insertions in the small tree rooted at r, starting when its 
counter reaches s. The whole procedure consists of 3 phases. 

1. We construct the array A for r's parent over | insertions, which requires allocating and 
initializing the memory, and then iterating through all of r's brothers. Note that while we are 
doing so, new brothers can appear, hence we should keep all children of a node in an unsorted 
linked list, and append each new child there. The speed of the initialization and iteration 
should be chosen so that we have enough time to process up to a elements. It might happen 
that more than one child of a node wants to start the iteration, so each node should have an 
unique process which constructs the array, and whenever an insertion occurs, we execute a 
constant number of steps at the parent's process. 

2. We run a depth-first search to determine the sizes of all subtrees rooted at the nodes of the 
small tree. The search is performed incrementally over | insertions, again, with the speed 
of the simulation adjusted so that there is enough time to process a tree of size |s, as new 
leaves can appear while we are still in the middle of the traversal, and we might already have 
gs leaves in the subtree at r to start with. 

3. We again traverse the tree rooted at r in a depth-first fashion, and make all nodes of size 
exceeding | heavy, where by size we actually mean the size computed in the previous phase. 
During this traversal we also update the pointers to the root of the small tree a given node 
belongs to, and rebuild all those smaller trees. More precisely, at node v we set its pointer 
and insert v into the corresponding smaller tree in the same step. Again, the speed of the 
simulation is adjusted so that we can process a tree of size |s over | insertions. Note that as 
the rebuilding is done in an incremental manner, it might happen that we will update the old 
smaller tree while the simulation is still running. Nevertheless, in such a case we will sooner 
or later encounter the place where the update occurred, and change the corresponding new 
smaller tree accordingly. Notice that all new small trees are of size less than s till the very 
end of the simulation. 

Each query requires probing at most 0(m) static dictionaries, one dynamic predecessor structure, 
and at most one small tree. Each update, on the other hand, requires inserting a new key to one 
dynamic predecessor structure, and updating a constant number of smaller trees, hence the claimed 
time bounds follow. The linear space bound follows from the observation that each of the original 
nodes occurs in at most two smaller trees. 

4.2 Weighted Exponential Trees 

In this section we prove the following theorem, which might be of independent interest. 

Theorem 12. There is a linear-size data structure that allows storing a collection of weighted 
sorted keys from an ordered universe of size u so that predecessor search takes 0{lg iglgf J'^ ) 
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time, where W is the current total weight of all elements, and w is the weight of the predecessor 
found. Inserting a new element of weight 1 takes 0{lglgW) time, and increasing by one the weight 
of an element of weight w takes 0(\g ^^^) time. All bounds are deterministic worst-case. 

Whenever we insert a new element, we get its handle in return, which then can be used to increase 
the weight by one. If we do not have this handle, increasing the weight requires performing a search 
first. 

The proof of the theorem is based on the beautiful idea of Bender, Cole, and Raman [6] , who have 
shown how to significantly simplify the deamortization presented by Andersson and Thorup [2]. 
We start with the amortized version of the theorem, and later show how to make it worst-case 
efficient. 

Let f{i) = [2^2) \ , so i = e(lglg/(^)). We define a weighted multiway search tree with the 
degrees increasing doubly-exponentially along any leaf-to-root path, which we call a wexponential 
search tree of level i, in the following recursive manner: 

1. the (explicitly stored) current total weight W of all elements is less than 2f{i + 1), and if 
W > 2f{i) the tree is proper, 

2. we store a static predecessor search structure (implemented using Prop.js]) containing a subset 
S = {ei, . . . , e\s\} of all elements, which we call the splitters, 

3. the remaining elements are split into Xq, . . . ,^|5| such that Cj is between and Xi, 

4. the total weight of all elements in {a} U Xi U {ci+i} exceeds f{£) — f{i — 1) for all i = 
l,2...,\S\-l, 

5. each Xi is stored in a wexponential search tree of level i — 1, which are called the children, 

6. for each i the predecessor structure stores a bidirectional link to the child storing Xi, and 
additionally a link to the leftmost child storing Xq is kept. 

Observe that if a weight of an element is at least 2f{i), it must belong to S. Note also that the 
definition requires that some of the Xj's may be empty. Furthermore, we can bound the size of S 
as follows: 

\s\ < ^ ^(.fiyii) = o(2(i)--(i)^) = o(2Uir) = oifhD). (i) 

Updating the structure will be done in a bottom-up order. In other words, we will assume that 
the children are valid wexponential search trees of level at most i — 1, and show how to ensure 
that their parent is a valid tree of level i. Inserting a new element of weight one or increasing the 
weight of some element by one might cause the total weight to become 2f{i +1). As soon as we 
detect such a situation, we split the tree into two by choosing an element ej from S. To choose this 
element we look at the sets of its predecessors Pi = XqU {ei} U . . . U {ej_i} U and successors 
Si = XjU{ej+i}U. . .U{e|5i}UXi5|. As the total weight is 2/(£ + l), and the weight of any Xi is less 
than 2f{i) (by conditions and [T ) , we can always select an i so that the weight of both Pi and Si 
is less than /(£+l) + /(£), but the weight of both PiU{ei} and {eJUSi is at least f{i + l)-f{e); 
see Fig. [2} We construct two new wexponential search trees of level £ containing all elements in Pi 
and Si, which is possible as their total weights are at most f {£ + !) + f{£) < 2f{£ + l). This requires 
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Figure 2: Finding a suitable new splitter Cj. The smaller of the two parts has size at least f{i + 
1) — f{£), and hence the larger one at most f{£ + l) + f{£). 



constructing static predecessor search structures containing ei, 62, ... , e^-i and e^+i, 6^+2, . . . , e|5|, 
respectively, and making each wexponential search tree of level £ — 1 a child of the former or the 
latter new tree. Notice that we don't have to rebuild the smaller trees, as simply redirecting the 
pointers to already existing structures is enough. Then we look at the parent of the structure that 
we are splitting. If there is none, we simply create a new proper wexponential search tree of level 
£+1 with just one splitter e^. Otherwise we add to its set of splitters, and store pointers to the 
two newly created trees of level £ in its predecessor structure, which needs to be rebuilt, or refreshed. 
The whole splitting and refreshing process is a very local procedure, as it requires rebuilding the 
static predecessors structures only for the tree and its parent, while the descendants and further 
ancestors are kept intact. 

Now we can describe how to query and update the wexponential search tree. 

predecessor search: First we use the static predecessor search structure, which takes 0{j^^^). 
If the query element belongs to S, we are done. Otherwise we recurse in the smaller structure. 

insert: Using the static predecessor structure we locate the smaller structure the new element 
belongs to, and insert it there recursively. Then we increase W by one and split the tree, if 
necessary. For each new element we allocate a record storing a link to the tree where we have 
used it as a splitter, and return a pointer to this record. Whenever some static predecessor 
search structure is rebuilt, or an element is moved up to the parent, we update the record. 

increasing: We locate the tree where the element is a splitter using the record in constant time. 
Then we increase the total weight by one and split the tree, if necessary, and move up to its 
parent. 

To analyze the complexity of this implementation, we first need a simple lemma. 

Lemma 13. Consider a proper wexponential search tree of total weight W and an element of weight 
w. The element is used as a splitter at depth 0(lg ^^). 

Proof. As the tree is proper, its level is @{\glgW). On the other hand, if an element of weight 
w belongs to a subtree of level i, but is not chosen as a splitter there, then w < 2f{i), which is 
equivalent to £ = Q{lglgw). Hence the maximum possible difference between the level of the whole 
tree and the level of the subtree where the element is used as a splitter is 0(lg ^^^). • 

The above lemma shows that worst-case complexity of predecessor search in a wexponential search 
tree is 0(lg k li w ) ' where W is the total current weight and w is the weight of the predecessor 
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found. Indeed, consider this (unique) predecessor: it must be stored at depth 0(lg^^), and 
traversing each level requires one query to the static predecessor structure. The complexity of 
both insert and increase is more tricky to estimate, as we might need to repeat the expensive 
splitting procedure a couple of times. Nevertheless, insert traverses all O(lglgW^) levels, and 
increase traverses just 0(lg^^^) levels. At each of those levels we might need to split the tree, 
which takes a lot of time, but cannot happen very often. We start with an amortized bound. 

For each wexponential tree we maintain an invariant that we have at least max(0, W — f{i+ 1)) 
credits allocated there, where W is the current total weight and i is the level. As long as we don't 
split, the invariant is easy to maintain: whenever we move from a tree to its parent during an insert 
or increase and add one to its total weight, we put an additional credit there. Now consider splitting 
a tree of total weight W = 2f(£ + 1). We need to rebuild the static predecessor structures at both 
the tree (or, more precisely, at the two new trees) and its parent, hence we need to apply Prop, [s] 
to a set of size which we bounded in by + 1), which takes 0{f{i + 1)) time. On the other 
hand, we have W — f{i + 1) = f{i + 1) credits available, and for each of the two new trees we need 
to keep just max(0, /(£ + 1) + f{£) — f {£ + !)) = f{i) of them (recall that the larger of the new trees 
has weight at most f{£ + l) + f{tj). Hence we can use the remaining f{£ + l) — 2f{£) = Q{f {£ + !)) 
credits to pay for the reconstruction. 

As usual, deamortizing the running time requires more care. Fortunately, we can fairly closely 
follow the method of Bender, Cole, and Raman j6|. The only part of the implementation that is not 
worst-case efficient is splitting. Hence instead of immediately splitting a tree as soon as its weight 
becomes 2f{£ + 1), we perform it incrementally over ^f{£) updates concerning the tree or one of its 
descendants, starting when the weight becomes 2f{£ + 1). Furthermore, instead of refreshing the 
parent as soon as we have two new trees, we use the bidirectional pointer to replace the link to the 
old tree kept there by a record containing the element used for partitioning and the links to the 
new two trees. As long as the parent is not fully refreshed and the new trees are still linked from 
the same record, we call them twins. Similarly, refreshing is performed incrementally over ^f{£) 
updates concerning the tree or one of its descendants, starting whenever we notice that the weight 
is a multiple of ^f{£) not exceeding 2f{£ + 1) — ^f{£). This ensures that we never need to split a 
twin, as between splitting a child and splitting one of the two new children created as a result, the 
tree will be fully refreshed, hence we avoid a situation where we already keep a record containing 
two links, and now we would need to replace one of them by such a record again. Observe that we 
never try to refresh and split a tree at the same time, hence there is no interference between those 
two operations. 

Splitting requires first choosing a good splitter e^, which can be done in a single left-to-right 
sweep through the contents of the static predecessor search structure, and then building new static 
predecessor search structures containing {ei, 62, ... , ei_i} and {cj+i, . . . , ei^i}. This must be done 
in the background, hence it might happen that some updates concerning the already seen part 
occur. Hence we can only guarantee that the total weight of Pi and Si is at most f{£ + 1) + |/(^), 
as there might be up to ^f{£) of such updates. An additional complication is that the weight of 
Pi or Si might be so that we would expect the corresponding new tree to be undergoing refreshing 
at the moment, which is the reason we have chosen to refresh when the weight is a multiple of 
^f{£) instead of simply taking a multiple of f{£). We never skip two consecutive moments when 
we should start refreshing, as the next split starts not sooner than after f(£ + 1) - §/(£) > f{£) 
updates, hence it still never happens that we try to split a twin. 
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Rebuilding the static predecessor search structures in the background requires storing two ver- 
sions, old and new, at the same time. More precisely, we have the old version, which we use for 
navigation and answering any query, and the new one that is being built. The corresponding el- 
ements in both versions are linked to each other, and any update is performed in both of them. 
Then, when the new version is ready, we simply discard the old one in constant time. Discarding 
can be done by storing timestamps that can be used to determine which links are still valid, and 
which should be actually null at the moment. The definition of a wexponential tree must be slightly 
relaxed by saying that its current total weight never exceeds 2f{i + 1) + ^f{i), so the bounds on 
the complexity of search, and both insert and increase (excluding the cost of splitting) still hold, 
and the theorem follows. 



4.3 Amortized Version 

In this section we develop an amortized version of our dynamic compacted trie search structure. 

For each node v we keep two separate structures. The first is a static dictionary implemented 
using Prop. |6] that stores edges leading to all children Vi of size "similar" to the size of v. The second 



is a dynamic predecessor data structure implemented using Theorem 12 To make the notion of 
"similar size" more precise, define the weight of a node to be the number of leaves in its subtree, 
and its level to be i when the weight belongs to [f{i), 2f{£ + 1)) {[f{i), 3f{£ + 1)) in the worst-case 
version presented in the next section), where f{i) = 2^^^^ , again. Clearly, the weights and hence 
the levels along any root-to-leaf path are nonincr easing. We define the fragment of node v to be the 
maximal subtree containing v and consisting of nodes of the same level. The root r of a fragment 
containing v is its lowest ancestor of level i, but with parent of level at least i + 1. For each such 
fragment we store the root r and a list of bidirectional links to all its nodes, and a counter with the 
number of leaves in the subtree rooted at r. For each node v we store all edges leading to children 
of V of the same level £ in a static dictionary. All edges leading to children of smaller levels are 
kept in a wexponential search tree. We would like the weights in this structure to be the same as 
the weights of the corresponding nodes, but for technical reasons we maintain a weaker condition, 
namely a node of weight w has weight from [^/w, w] in the wexponential search tree stored at its 
parent. First we show that this relaxation is enough to guarantee good bounds on the search time. 

Lemma 14. Traversing any path of length m starting at the root takes just 0{m + i^i^f^^ ) time. 

Proof. At each node we first use the static dictionary to check if the next edge we would like to 
traverse is stored there. If so, we continue. Otherwise we query the wexponential search tree. In 
order to bound the complexity of the whole procedure, we only have to bound the total time of the 
latter steps, as the former sum to at most 0{m). Observe that whenever we query the wexponential 
search tree, we either terminate, or decrease the current level, which is already enough to get a bound 
of 0( /gig\^g'^ )- '^o set the claimed complexity, let Wi, W2, ■ ■ ■ , Wk be the weights of nodes where 
we query a wexponential search tree. Similarly, let wi,W2,---,Wk be the corresponding weights 
of nodes that we find there (note that Wi is not necessarily stored in our implementation, but Wi 
certainly is). Let be the weight of the elements of the wexponential search tree corresponding 
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to Wi, and the total weight of this structure. Then we have the following inequalities: 
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Whenever we add a new leaf, the weights of all nodes on its path to the root increase by one. 
We iterate over all fragments above the new leaf and increase their counters. Iterating is done 
in 0(lglgo") time by starting at the leaf and repeatedly jumping to the root of the current frag- 
ment by following the bidirectional link stored at each node. To maintain the invariant that the 
weights on any path are nonincreasing, we actually first construct a list of all fragments, and then 
update their counters one-by-one in a top-down order. For each root that we consider we need 
to update its corresponding weight in the wexponential search tree at its parent, which takes at 
most 0(lg ) = 0(1 + Ig ^^^) time, where w is the weight of this root, and W is at most the 
weight of its parent. Summing up over all roots, as in the proof of Lemma 14, we get a telescoping 
expression which is at most 0{lglga). 

During this procedure it might happen that we increase the weight of some root r to 2f{i + 1), 
and hence need to increase its level. Maintaining the invariant in such situation is a very costly 
procedure, and we need to somehow amortize this cost. We start at v := r and descend down to 
its (unique) child of weight exceeding f{£ + 1) as long as possible. 

Note that we don't actually store the weight of each node, but given a weight of v with exactly 
one child of level i, we can compute the weight of this child by iterating through all other children, 
which are roots of their fragments, and hence have up-to-date counters available. Note that if there 
is more than one child of level £, there cannot be any child of weight exceeding f(i + 1). We call 
the traversed path the tail, and increase the level of all its nodes, see Fig. |3j Then maintaining the 
invariants requires four steps. 



1. If the parent of r is of level i + 1, we must rebuild its static dictionary in order to include a 
new element there. 

2. We move all nodes on the tail from the list of the current fragment to either the list of the 
fragment corresponding to the parent of r, if its level is £ + 1, or a new fragment. 

3. If the last node on the tail has more than one child of level i, bumping its level to £ + 1 
splits the current fragment into more than one. We need to iterate through all nodes there, 
and partition them accordingly creating new fragments. Note that creating a new fragment 
requires computing the weights of their roots, which can be done by iterating through children 
of smaller levels for all nodes in the current fragment. 
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Figure 3: Bumping the levels of tail nodes. 



4. For each child of level ^ of the last node on the tail we must add a new element to the 
wexponential search tree. Note that at this point all those nodes are roots of their fragments, 
hence have their weights computed. 

Notice that all those steps are local in the sense that they modify only the nodes in the current 
fragment. To bound the time taken by the whole procedure, we allocate credits to fragments, 
making sure that a fragment of weight w and level ^ has max(0, — f{i + 1)) credits available. 
Whenever we split an edge and create a new leaf, we allocate one credit for each of the at most 
0(lglgo") fragments above. Then when we are increasing the level of r, its weight is 2/(^ + 1), so 
we have f{i + 1) credits available, and because of the way we defined the tail, we can spend all of 
them, as all new fragments of level I will be of weight at most f{l + 1) after the update. We can 
bound the time required for maintaining the invariants, which we call promoting at r, as follows. 

1. Rebuilding the static dictionary takes 0( j^q^ log^ log j^^) = ©(jjjlq^)- We call this 
refreshing the parent of r. 

2. There arc at most 4/(£+ 1) — 1 nodes in the subtree of r, hence traversing the tail, including 
the time taken to compute the weight of all nodes there, takes 0(/(£ + 1)). 

3. There are at most 2f{i + 1) nodes in the current fragment, hence the nodes in the current 
fragment can be partitioned into new fragments in 0{f{i + 1)) time. This includes the time 
necessary to compute the weights of their roots, as in the worst-case we iterate through at 
most 4/(^ + 1) — 1 nodes in the subtree of r. 

4. We must insert at most ^-^j^^"^-* elements into the wexponential tree stored at the last node of 
the tail, and inserting an element of weight w is done by first adding a new element of weight 
one, and then increasing its weight repeatedly ^/w times. 
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Hence the total number of credits required by the promoting is: 



Hence we only need to ensure that j^2|^!]]^| < f{i + ^), which is equivalent to f^{£ + 2) < f^{i+l), 
and then 2(|)^"'"^ < 3(|)^"'"^, hence by the choice of / we always have enough credits to amortize 
the update. 

4.4 Worst-Case Version 

In this section we develop the final worst-case efficient version of our dynamic compacted trie search 
structure. 

The only non worst-case efficient part of the previous implementation is increasing the level of 
a root r. Instead of traversing the tail and updating all the structures as soon as its level reaches 
2f{i + 1), we will execute those operations incrementally over the next f{i + 1) insertions in the 
subtree rooted at r, and relax the condition on the weight of a node of level i by saying that it 
shouldn't exceed 3f{£ + l). As selecting the tail is done incrementally, we redefine it to be the 
maximal sequence of nodes of weight at least s at the moment we started the process, which requires 
storing at each edge a timestamp denoting its creation time. 

First of all, we must make sure that there is at most one promoting process per fragment. 
Even though we have already chosen the tail, future insertions might increase the weight of some 
additional nodes to more than f{£ + l). Nevertheless, as we execute the procedure over just f{£ + l) 
insertions, no node in the current fragment (or in one of the new fragments) which doesn't belong 
to the tail can reach the weight of 2f{£ + 1) before we are done. Furthermore, there is some 
interaction between different fragments. While we are still promoting at r, new nodes might be 
added to the corresponding fragment. Also, different children of a node might need to be refreshing 
it at overlapping periods of time. 

We choose the speed of the simulation so that there is enough time to process a fragment con- 
sisting of 3f{£ + 1) nodes over f(£ + 1) insertions. When splitting the current fragment into more 
than one, we run a depth-first search to determine the new fragments. As soon as we reach a node, 
we set the link to its (new) fragment. Then when a new node appears, its parent is either already 
processed and hence has the correct link set, or will be seen later, and we will notice the new child 
then. The same reasoning works for inserting the new elements into the wexponential tree stored at 
the last node of the tail. Refreshing a node is a quite different issue, though, as we must somehow 
deal with the problem that many children might need to refresh the same parent. Each node of 
level £ stores a list of all its children of weight at least f{£), where we simply append a child as soon 
as its weight becomes large enough. As a part of our simulation we run the refreshing process. In 
other words, each child of a node can be potentially running a deferred process refreshing its parent. 
The first step of such process is making a read-only copy of the current list. As this list only grows, 
instead of making a physical copy we can simply store the current last element, which can be done 
in constant time. Then we build a new static dictionary containing all elements on this read-only 
copy over the insertions in the subtree rooted at the child. As soon as the construction finishes. 
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we substitute the old dictionary in constant time by replacing one pointer. There is just additional 
detail: we should first check if we are really replacing an older version, i.e., we can simply look at the 
number of elements stored there. This is because it might have happened that there is a refreshing 
process which starts later, yet finishes earlier (as there are more insertions to the corresponding 
subtree). This ensures that when the weight of r reaches 3/(^ + 1), the static dictionary stored a 
its parent surely contains its edge (and, potentially, also some more recent edges). 

5 Conclusions and Open Problems 

We gave data structures for improved deterministic worst-case searching and update times in suffix 
trees and compacted tries in general. Whereas previous data structures either ignored the alphabet 
size completely [4] or had at least a logarithmic dependency on the alphabet size, our new structures 
have only a doubly-logarithmic dependency on the alphabet size. The obvious open questions are 
if better running times are possible, or if non-trivial lower bounds exist? 

Appendix 

A Better suffix tree oracle 

In this section we show how to implement a suffix tree oracle with f{n, a) = 0(lglgn+ i^i^f^^ ) • At 
a high level, we follow the approach of Breslauer and Italiano [T], which in turn is based on Weiner's 
algorithm [is]. The main difficulty is that both methods are only efficient when the alphabet is of 
constant size, which is not our case, and some additional ideas are required. Nevertheless, we are 
able to achieve f{n,a) = 0(lglgn + Igo") by rather simple means. Then we decrease this bound 
even further by applying the structure of Kopelowitz |14| , which needs some minor tweaking as we 
are interested in a deterministic solution. 

For each node u of the suffix tree we define its o-link Waiu) if the suffix locus v = au exists. If 
V = au \s & node, then the link is called a hard a-link and points to this node, and otherwise a 
soft a-link and points to the lower end of the corresponding edge. Hard links correspond to inverse 
suffix links. Assuming that we maintain the suffix tree using Weiner's algorithm, the following 
properties hold. 

Observation 15. (a) If Wa{u) is defined and v is an ancestor of u, then Wa{v) is defined, too. 
In other words, the sets of links are monotone when we traverse the tree starting from the root. 

(h) As soon as Wa{u) becomes hard, it stays hard and cannot change. On the other hand, a soft 
link may change or become hard at some point. 

(c) Wa{u) is defined if and only if there is a descendant v of u with the hard a-link defined, and 
in fact a stronger statement holds: Wa{u) is defined iffWa{u) = Wa{v), where Wa{v) is a hard 
a-link, and v is the (unique) highest descendant of u with the hard a-link defined. (Uniqueness 
follows because if two nodes have a hard a-link defined, then their lowest common ancestor also 
has it.) 
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Breslauer and Italiano show that to implement a suffix tree oracle, one needs to store the suffix 
tree such that the following four operations can be performed: 

1. given a letter a and a node v, locate the lowest ancestor u oi v with the a-link defined, and 
retrieve its Wa{u), 

2. make a soft link hard, 

3. create or change a soft a-link, 

4. split an edge (n, v) of the suffix tree, and for each existing a-link Wa{v) (both soft and hard) 
create a new soft a-link Wa{v') := Wa{v), where v' is the new middle node and v is the lower 
end of the split edge. 

After prepending a single letter to the text a constant number of operations of each type is 
executed. The most problematic is the last one, as there might be many defined a-links from v. 
To overcome this, we will maintain the soft links just at some carefully chosen nodes. For the 
remaining nodes we will exploit (|c]) to recover the missing soft links whenever we need to access 
them. Furthermore, during the execution of the algorithm it might happen that we haven't yet 
updated some soft links. Nevertheless, we always update them in a top-bottom order, so Q still 
holds, and all hard links are eagerly created as soon as they should appear. The details are as 
follows. 

A.l Tree Decomposition 

We (again) divide the nodes into heavy and light, but this time we define the weight of a node to 
be the number of leaves in its subtree plus the number of hard links defined there (which due to 
(|b]) can only increase). We choose parameter s := a and say that all nodes of weight at least 3s are 
heavy, all nodes of weight at most s leaves are light, and the remaining nodes can be either light 
or heavy. 

Heavy nodes store all Wlinks in a dynamic dictionary (implemented using Prop. [9]), whereas light 
nodes store only the hard ones. Additionally, we maintain a structure which allows us to locate for 
a given heavy node v its lowest ancestor u with the a-link defined (point (1) above). The details 
on how to update this structure will be given later; we first describe how it can be used. We will 
consider the two possible cases where the answer is a heavy node or when it is light. 

A. 2 The Answer is a Heavy Node 

First we need a result on a special case of the marked ancestor problem. 

Lemma 16 (Fringe marked ancestor (FMA) structure |7j). We can maintain a tree T on n nodes 
which are either marked or unmarked under the following operations: 

1. inserting a new node in the middle of an edge, which adopts the marked status of its parent, 

2. inserting a new leaf which is initially unmarked, 
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3. marking the root or a node whose parent is already marked, 



4- locating the lowest marked ancestor of a given node. 

Due to the way marking is done the marked nodes form a connected subtree containing the root. 
Hence, the answer to queries on unmarked nodes will always lie on the fringe of this subtree. All 
operations cost O(lglgn) time which is faster than in the general setting where a lower bound 
of ^ dgf^yt ) ^'-'^ least one of the operations exists |lj 

We store a single FMA structure over the suffix tree, where all heavy nodes are marked. This 
allows us to locate for a given node x its lowest heavy ancestor v in O(lglgn) time. Because the 
heavy subtree can be quite large, we now need to do the following. Define the induced heavy subtree 
as the tree that contains only the heavy leaves and branching heavy nodes. We store this tree 
explicitly, and couple all edges of the original suffix tree with their corresponding edges in the 
induced tree by bidirectional links that allow us to move back and forth between the two trees. 

In this induced tree, for each letter a we keep a separate FMA structure, where a node is marked 
when its a-link is defined. Due to Q the marked nodes form indeed a connected subtree containing 



the root, so Lemma 16 can be applied. Additionally, for each edge of the induced tree we keep an 
array lowest [a] storing a pointer to the lowest heavy node (of the original suffix tree) that is coupled 
with the edge and that has the a-link defined. Due to Q this array gives us the only candidate on 
the edge of the induced tree. 

All FMA structures, arrays, and pointers to edges are enough to locate for a given heavy node v 
its lowest ancestor u with the a-link defined. First we check if u and v belong to the same edge of 
the induced tree. If not, we replace v by its lowest branching ancestor v' by going to the top node 
on the edge of the induced tree. Then we locate its lowest branching ancestor u' with the a-link 
defined using the FMA structure. Then if we look at the edge outgoing from u' and leading to the 
subtree v belongs to, u surely is there. The edge can be located by retrieving the corresponding 
letter, and return lowest [a]. The total time taken by those operations is 0(lglgn + i^i^f^^ ) , where 
the second addend is due to the edge retrieval. Furthermore, the total size of the structure is linear, 
as we need 0{a) space per edge, and we keep a FMA structures, each on O(^) nodes. 



A. 3 The Answer is a Light Node 

For each maximal subtree consisting of light nodes we store a separate structure allowing us to 
quickly locate for a given node v in this subtree its lowest light ancestor u with the a-link defined, 
and retrieve its Wa{u). By (|c]), we only have to maintain all hard links as long as given v we can 
quickly locate its highest descendant u with the hard a-link defined. Assume that we can maintain 
a preorder traversal of all nodes in the subtree. Then there is just one candidate for u: the first 
node after v in the traversal with the hard a-link defined. Hence if we maintain for each a a separate 
predecessor structure where we put all nodes in the subtree with the hard a-links defined, we just 
have to perform one predecessor query. In the simplest form, the structure could be any balanced 
search tree, giving us a 0(lga) time bound if we observe that comparing any two elements there 
reduces to computing the lowest common ancestor, which can be done in worst-case constant case 
even on dynamic trees j8j. 

We aim to achieve better bounds, though. For this we would like to implement the predecessor 
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structure using Prop. [9j but in order to do so we need to assign integer labels consistent with the 
preorder traversal to the nodes. This is exactly the POLP problem considered by Kopelowitz 14 . 



Lemma 17 (Predecessor search on dynamic subsets of an ordered dynamic list [14]). There exists 
a data structure for a dynamic ordered list of size n and disjoint subsets such that: 

1. order queries are answered in 0(1) worst-case time, 

2. creating a new node after a given node takes 0(1) worst-case expected time, 

3. adding an element into any subset takes O(lglgn) worst-case expected time, 
4- predecessor search in any subset takes O(lglgn) worst-case expected time. 

We apply his method with the following modifications. 

1. We aim to achieve deterministic time bounds, and hence need to replace y-fast tries by the 
structure from Prop. [9} In order to do so, we must verify that the following is possible: given 
a structure storing a set S, we can incrementally perform insertion into S over 0{lga) steps 
so that we can perform predecessor queries over the original S while the insertion is not over 
yet. But this is easy if the structure is an exponential search tree, which is just a multiway 
search tree where the insertion is done in a bottom-up order. Whenever we replace a node 
V hy v' , we store the pointer to the old v at v' together with a timestamp indicating when 
we should start using the new v' to answer predecessor queries. As there is just one insertion 
process per structure, we never need to store more than one previous version per node. 

2. As the data structure of Prop. |9] is of linear size, we do not need the bucketing as in the y-fast 
tries, which simplifies the structure a little bit as there is no interference between splitting of 
buckets and chunks. 

3. All sets stored in the structures should be disjoint to guarantee that whenever a label of some 
element changes, we need to update just a single element in one of them. For this we define 
our list to contain all pairs (u, a) such that Wa{u) is a hard link in the natural lexicographical 
order, i.e., extending the preorder on the nodes. 

As a result, we get both queries and updates in deterministic 0{ ^^^^f^^ ) worst-case time, where 
by update we mean either splitting an edge or creating a hard link. This is of course assuming that 
the weight of each light subtree stays at most s. 



A. 4 Updating the Structures 

As soon as the weight of some light subtree reaches 2s, we start the bumping process which is 
executed over the next s updates there. As in Section 4.3, we define the tail to be the maximal 
sequence of light nodes of weight at least s starting at the root of the subtree (again, by weight we 
actually mean the weight at the moment when we started the process) . We construct a list of those 
nodes, and rebuild the structures for all new smaller light subtrees. This takes 0(s) time because 
we defined the weight to include the number of hard links, and can be performed incrementally 



20 



over I updates keeping the structure for the original subtree as long as the procedure is not over 
yet so that we can answer queries. Then we must make all nodes on the tail heavy, which adds one 
edge to the tree induced by the branching heavy nodes, hence can potentially make at most one 
non-branching node branching. 

First of all, we must set pointers to this new edge and construct its array in 0{a) time. Then 
we might need to split an existing edge into two, which requires updating the pointers of all nodes 
there and constructing the array for the two new parts. Here we need to modify our definition of 
an edge slightly: we don't want its length to significantly exceed s. Hence we consider the tree 
induced by all branching nodes and some virtual branching nodes chosen so that each edge is of 
length at most |s and either connects at least one real branching node, or is of length at least 
|. In other words, whenever we have a long edge, we add a new virtual branching node in the 
middle. Now whenever we split an edge into two parts, we need to iterate over at most s nodes 
vi,V2, ■ ■ ■ ,Vk. For each of the two resulting parts we need to construct the arrays and change the 
pointers to the current edge. This needs to be done incrementally over the remaining | updates 
into the light subtree. The first difficulty is as soon as we modify the pointer to the current edge of 
some Vi, the array of this edge must be ready. So it seems that we should first construct the arrays, 
and then set the pointers. But then some new soft links could appear, and we wouldn't yet know 
which array should be updated. Hence we temporarily store pointers to both the old and the new 
edge at each node, updating both of them whenever a soft link changes or appears, and proceed in 
phases as follows. 

1. Initialize each of the new arrays to contain just nulls. 

2. Insert the new middle branching node into at most a FMA structures. 

3. Iterate over u^, . . . , and set the pointer to the new edge for each of them in this 
order. Furthermore, if Wa{vi) is defined but Wa{vi-\-i) is not, modify the value in the array 
corresponding to the current new edge in the same time step. To quickly detect all such a, 
for each heavy node maintain a linked list, and observe that creating a new soft link Wa{u) 
requires adding a new element to the list of u, and removing one element to the list of its 
parent. 

4. Update the array corresponding to the upper part of the edge by iterating over all letters. 

We execute this procedure incrementally over a number of updates to the light subtree. Mean- 
while, the queries are answered by checking both the old and the new edge. If the only thing 
happening were the updates to exactly one light subtree hanging off some Vi^ we could schedule 
the procedure over of them. Unfortunately, more than one subtree might grow too large at 

the same (or similar) time, and furthermore new nodes can appear on the edge. In other words, 
new branching nodes can appear, and the length of the edge can increase, forcing us to add new 
virtual branching nodes. Instead of running a separate process for each new branching node, we 
create one master process per edge and store a queue of nodes which should be made branching. 
The master process runs over | updates (each of them either increasing the weight of some subtree, 
or creating a new node on the edge), and starts as soon as we notice that the queue is nonempty 
or the length of the edge exceeds s. Then it makes a local copy of the current queue (clearing it 
afterwards), and splits the edge so that all nodes in the local queue are made branching and all 
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new edges consist of at most |s nodes. To achieve the latter, if the edge was of length at least s 
before the update, we insert one new virtual branching node in the middle, and then even if up to 
I new nodes appear, the length of each new edge is still below |s, but at least |. As a result we 
get a bunch of smaller edges with potentially nonempty queues, and we might need to run their 
master processes as soon as the first update to their parts of the tree happens. It is clear that this 
guarantees that the edges never become too long, but we must also verify that any node Vi is made 
branching after at most | updates to its light subtree, and this is why we have chosen the master 
process to run over | updates. At most | first updates to the light subtree are spent waiting for 
an already running process to finish, and then Vi belongs to some queue, so the next | updates will 
be used to make Vi (and possibly some other nodes) branching. 

This proves Thm. [5| 
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